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A  Model  for  The  Processing  of  Position  Inforaation  in  the  Huaan  Visual 
Systea 

le  present  a  aodel  for  the  processing  of  positional  inforaation  in  the 
huaan  visual  systea,  with  particular  eaphasis  on  visual  tasks  that  involve 
the  aeasureaent  of  spatial  separation.  The  aodel  is  in  aany  respects  a 
radical  departure  froa  current  thinking  about  probleas  in  vision.  Of 
particular  note  is  the  fundaaental  significance  we  attach  to  the  retinal 
photoreceptor  lattice,  considered  as  a  two-diaensional  spatial  saapling 
systea.  Mechanisas  of  neural  interpolation  are  discussed  and  hyperacuity 
is  a  natural  consequence  of  the  aodel.  Major  concerns  which  we  do  not 
address  are  questions  of  teaporal  dependence  and  the  integration  of 
binocular  inforaation.  Ve  refer  to  the  aodel  as  the  scaled  lattice 
aodel. 

The  aodel  consists  of  a  number  of  eleaents  suaaarized  in  Figure  1  as 
follows : 

-  an  external  luminance  distribution  (inage)  is  blurred  by 
convolution  with  a  point  spread  function  (or  line  spread  function  for  a  one 
dimensional  distribution)  and  then  saapled  by  a  lattice  of  retinal 
photodetectors  with  center-to-center  spacing  d  (Fig  1:  a,b,c). 

-  the  saapled  iaage  is  transmitted  to  higher  visusl  centers  where  it 
is  reconstructed  on  a  fceural  lattice  by  interpolating  between  the  saaples 
(Fig  1:  d,e) . 

-  the  effective  spacing  of  the  neural  lattice  is  sons  integer  (N) 
tines  finer  than  the  spacing  of  the  photodetectors,  (Fig  1:  the  ratio  of 
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photodetector  spacing  to  the  neural  grid  is  1/3). 

-  position  information  in  the  reconstrneted  image  is  extracted  by 
determining  the  locations  of  specific  stimulus  features  on  the  neural 
lattice,  in  particular  the  luminance  peaks,  (Fig  1:  e.f). 

-  the  separation  between  two  features  such  as  peaks  is  determined  by 
counting  the  number  of  neural  units  between  the  features  (Fig  1:  e.f.g). 

-  the  error  in  determining  a  separation  (da)  is  proportional  to  the 
positional  quantization  (spacing)  of  the  neural  lattice. 

~  the  interpolation  factor  N  (the  ratio  of  photoreceptor  spacing  to 
neural  unit  spacing)  wariea  with  the  separation  of  the  peaks  in  such  a 
way  that  the  fractional  error  in  separation  (Aa/s)  is  roughly  constant. 

The  effects  of  retinal  sampllni 

The  following  discussion  assumes  one  dimensional  images  for 
convenience.  Consider  the  external  image.  It  is  described  by  the  function 
L(x)  which  gives  the  value  of  the  luminance  at  every  point  x.  For 
convenience,  we  assume  that  x  ranges  from  —•  to  ■»,  although  the  range  is  in 
fact  finite.  The  basic  problem  is  this:  since  L(x)  is  a  continuous 
function  of  x,  an  exact  representation  of  the  function  would  require 
measuring  the  luminance  at  an  infinite  number  of  points.  This  is  clearly 
not  feasible  since  an  eye  is  constructed  with  a  finite  number  of 
photoreceptors.  Thus,  the  question  arises  of  how  well  L(x)  can  be 
represented  with  a  finite  number  of  measurements.  A  formal  answer  to  this 
question  is  found  in  the  sampling  theorem:  if  L(x)  contained  no  Fourier 
components  with  a  frequency  greater  than  fBtz,  then  L(x)  could  be 
represented  exactly  by  measuring  its  values  at  points  separated  in  apace  by 
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a  distance  of  1/ (2fatz) .  Since  L(x)  has  a  finite  width  V,  the  total 
amber  of  measurements  is  2Wfaaz,  a  finite  nnaher.  The  saapling  theorem 
also  says  that  there  is  no  way  to  represent  L(x)  with  fewer  aeasnreaents. 
Thus  saapling  L(x)  at  an  interval  of  l/(2fazz)  gives  the  aost  eoapact 
possible  encoding  of  the  bandwidth  liaited  L(x).  However,  L(x)  is  not  a 
priori  liaited  in  bandwidth.  To  apply  the  saapling  theorea,  we  anst  reaove 
all  Fonrier  components  above  fazz.  This  is  accomplished  in  the  eye  by 
convolnting  the  external  iaage  with  the  retinal  (optical)  line  spread 
function,  which  effectively  filters  out  the  high  spatial  frequency 
components.  Thus,  the  sampling  theorea  provides  a  very  useful  guide:  if 
the  line  spread  function  removes  all  Fourier  coaponents  above  fazz,  then 
saapling  the  band-liaited  function  at  intervals  of  l/(2fazz)  entails  no 
further  loss  of  inforaation.  In  principle,  the  continuous  band-liaited 
function  can  be  exactly  reconstructed  from  the  saaples  by  use  of  the 
sin(x)/x  function.  However,  it  is  not  clear  that  it  is  necessary  or  even 
desirable  to  exactly  reconstruct  the  band-liaited  function  everywhere. 

After  ail,  the  band-liaited  function  is  already  a  distorted  version  of  the 
original  luainance  function. 

For  aoae  visual  tasks  it  night  suffice  to  reconstruct  the  saapled 
iaage  exactly  only  near  the  luainance  peaks,  where  the  function  is 
quadratic  and  relatively  easy  to  reconstruct.  Indeed  certain  forms  of 
distortion  in  the  reconstruction  night  be  useful  such  as  the  edge 
distortion  discussed  below.  However,  the  saapling  theorea  gives  a  good 
estimate  of  what  is  possible,  and  we  assuae  the  basic  reasoning  is 
applicable  to  the  huaan  visual  systen:  the  line  spread  function  acts  to 
band  limit  the  external  luainance  distribution  in  such  a  way  that  the  iaage 
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falling  on  the  retina  can  be  exactly  recovered  froa  the  saapled  version  of 
the  iaage.  To  see  more  clearly  what  is  lost  and  gained  by  this  process 
consider  Fig  la  and  lb.  Fig  la  shows  the  external  iaage,  a  narrow  line 
(delta  function).  Fig  lb  shows  the  iaage  falling  on  the  retina  after 
convolution  with  the  line  spread  function.  The  original  narrow  iaage  has 
been  spread  ont  and  now  has  a  shape  dictated  by  the  line  spread  function. 
Thus  at  the  initial  stages  of  iasge  reconstruction  we  have  lost  the  ability 
to  aeasure  widths  narrower  than  the  line  spread  function,  and  also  the 
ability  to  resolve  two  lines  whose  separation  is  less  than  the  width  of  the 
line  spread  function.  Note  however  that  the  position  of  the  peak  of  the 
distribution  in  Fig  lb  coincides  exactly  with  the  position  of  the  original 
line,  and  thus  no  positional  inforaation  has  been  lost,  as  long  as  the 
peaks  of  the  external  distribution  are  well  separated.  Froa  this  point  of 
view,  the  combination  of  convolution  with  the  line  spread  function  followed 
by  retinal  sampling  has  effectively  compressed  slnost  all  the  positional 
inforaation  of  the  original  distribution  into  a  finite  nuaber  of 
measurements  for  transmission  to  the  cortex,  where  the  ability  to  recover 
the  position  of  the  original  line  in  Fig  la  will  only  be  limited  by  the 
ability  to  locate  the  peak  of  the  distribution  in  Fig  lb. 

The  above  argument  can  be  restated  without  reference  to  the  sampling 
theorea.  If  the  line  spread  function  were  infinitely  narrow,  the  luminance 
at  a  sample  point  would  oontain  information  about  the  external  iasge  only 
at  that  point.  With  a;  finite  width  line  spread  function,  the  luminance  at 
each  saaple  point  contains  information  about  the  external  iaage  extending 
over  a  range  of  positions  around  that  point.  Since  the  luainanee  at 
location  x  is  now  highly  correlated  with  the  luminance  at  x  ♦  Ax,  if  Ax  is 
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much  leas  than  the  width  of  the  line  spread  function,  there  is  no  need  to 
■easnre  the  luminance  at  x  *  Ax.  Thus  a  finite  sampling  interval  is 
adequate.  A  nsefnl  analogy  is  that  the  photoreceptors  convert  a  continuous 
distribution  into  what  is  essentially  a  histograai.  The  center  of  a  peak  on 
a  histogram  can  be  determined  quite  precisely  (to  an  accuracy  such  smaller 
than  the  bin  size)  as  long  as  the  width  of  the  peak  is  several  bins.  (If 
the  peak  fell  entirely  within  one  bin,  we  could  not  determine  its  position 
to  an  accuracy  better  than  the  bin  width).  The  line  spread  function  acts 
to  guarantee  that  the  minimum  width  of  the  peak  is  several  bins.  Ve 
emphasize  that  the  image  must  be  blurred  in  some  manner  so  that  a  narrow 
external  image  excites  at  least  three  photoreceptors.  (The  number  three 
arises  because  a  quadratic  has  three  coefficients.)  If  not,  the  peak 
position  could  not  be  recovered  from  the  sampled  measurements.  It  is 
estimated  that  in  the  central  fovea  the  line  spread  function  covers 
approximately  four  photoreceptors  and  thus  this  condition  is  met. 

Peaks  and  Position  Information 

Te  regard  luminance  peaks  (maxima)  as  important  features  in  the 
analysis  of  an  image.  Thera  are  a  number  of  reasons  to  single  out  peaks 
for  special  consideration.  One  reason  is  illustrated  in  Figs  Is  and  lb. 

As  discussed  above,  there  is  a  close  connection  between  the  peak  in  lb  and 
the  position  of  the  line  in  la.  In  fact,  regions  where  the  luminance 
rises  or  falls  continuously,  or  remains  constant,  seem  to  contain  little 
information  about  'positions'.  'Position'  generally  seems  to  be  associated 
with  a  change  in  the  luminance  slope,  and  a  peak  occurs  when  the  sign  of 
the  slope  ohanges.  Peaks  have  a  number  of  other  desirable  properties. 
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Peaks  are  regions  where  a  quadratic  approxination  provides  a  good  fit  to 
the  measurements,  and  thus  peaks  are  quite  easy  to  reconstruct  accurately. 
Since  a  quadratic  function  has  three  coefficients,  reconstructing  a 
quadratic  only  requires  input  from  three  sample  points.  In  contrast  to 
this  a  function  like  sin(x)/x  is  a  high  order  polynomial  and  requires  many 
inputs.  Similarly,  peaks  (being  local  features)  are  determined  by 
comparing  the  luminances  at  nearby  points,  and  a  peak  detector  (Fig  If) 
only  requires  input  froa  three  adjacent  neural  units.  In  contrast  to  this 
an  edge  or  'zero  crossing'  detector  requires  as  input  both  the  value  of  the 
luainance  at  each  point  and  the  value  of  'zero'  at  each  point.  The  'zero' 
point  is  not  a  local  property  of  the  inage  but  rather  vonld  have  to  be 
defined  by  averaging  the  iaage  over  a  broad  region  in  soae  Banner.  This  is 
a  considerable  complication  and  it  is  hard  to  see  how  it  could  be  done  with 
sufficient  accuracy.  Adding  a  constant  luainance  to  an  image  does  not 
affect  the  peaks,  and  since  adding  a  constant  luainance  gradient  shifts 
peaks  by  an  amount  inversely  proportional  to  the  curvature  of  the  peak, 
peaks  of  the  same  curvature  are  shifted  by  equal  amounts,  leaving  their 
separation  unchanged.  Thus  slowly  changing  luainance  backgrounds  have 
little  effect  on  the  peaks.  Another  interesting  property  of  quadratics  is 
that  the  sum  of  a  set  of  quadratics  with  equal  curvatures  but  different 
centers  will  be  a  single  quadratic  whose  peak  coincides  with  the  mean  of 
the  centers  of  the  original  quadratics.  This  indicates  that  an  unresolved 
iaage  Cone  narrower  than  the  line  spread  or  interpolation  function)  will 
have  only  a  single  peak  which  coincides  with  the  mean  (center  of  gravity) 


of  the  luainance  distribution. 


s 


Birseh,  Joy  and  Hylton,  Hon 


There  ie,  however,  at  least  one  problem  with  concentrating  on 
lsainance  peaks  for  position  inforaation.  An  edge  or  step  function 
certainly  has  a  definite  position,  bnt  there  are  no  peaks  to  Bark  it. 

There  is  an  interesting  way  around  this  problem.  Suppose  the  interpolation 
function  had  a  shape  like  that  in  Fig  2a,  which  is  ronghly  the  shape 
appropriate  for  quadratic  interpolation.  The  rest  of  Fig  2  shows  what 
happens  to  a  step  function  (2b)  after  being  convoluted  with  the  line  spread 
function  (2c)  and  then  reconstructed  (2d)  using  the  interpolation  function 
of  Fig  2a.  The  interpolation  causes  overshoots  and  undershoots,  like  Mach 
bands,  to  occur  and  thus  produces  a  peak  to  nark  the  edge.  Thus  the 
'imperfect*  interpolation  has  a  very  useful  side  effect,  the  association  of 
a  peak  with  a  luminance  edge  in  the  reconstructed  function.  Interestingly, 
these  induced  peaks  do  not  coincide  with  the  original  stinulus  edges  bnt 
are  systematically  shifted.  This  introduces  a  systematic  error  in  the 
position  of  an  edge  that  sight  be  aeasurable.  (However,  since  the  error  is 
largely  systematic,  it  can  in  principle  be  corrected  for.) 

The  interpolation  mechanism 

In  this  section  we  discuss  the  mechanism  which  performs  the 
reconstruction  of  the  sampled  image.  Ve  assume  this  process  is  linesr  in 
the  sense  that  the  output  is  a  linear  function  of  the  inputs.  Then  the 
output  can  be  considered  as  the  convolution  of  the  (sampled)  input  function 
with  an  interpolation  function.  The  interpolation  function  gives  the 
weight  assigned  to  each  input  (photoreceptor)  as  a  function  of  the  distance 
between  input  position  and  output  (neural  unit)  position.  There  are  (at 
least)  two  quite  different  methods  to  perform  this  interpolation. 
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Direct  convolntlon.  The  flret,  which  ve  cell  Interpolation  by  direct 
convolution,  ia  illustrated  in  Fig  3a.  In  this  scheme  each  neural  unit 
(circles)  receives  input  directly  from  the  nearest  photoreceptors 
(triangles).  Te  have  drawn  each  neural  nnit  with  three  inputs,  since  this 
is  the  minimum  number  required  for  quadratic  interpolation,  and  the  figure 
ahows  a  neural  lattice  with  an  interpolation  factor  (N)  of  3.  The  inputs 
are  suamed  with  appropriate  weights  to  impleaent  the  interpolation 
function.  Ve  can  see  that  this  scheae  is  characterized  by  a  large  number 
of  connections  per  photoreceptors  (3N)  and  a  large  number  of  different 
weights  (3N/2  rounded  upwards). 

Mutusl  coup 1 ins.  An  alternative  scheae  is  presented  in  Fig  3b.  Ve  call 

this  interpolation  by  mutual  coupling,  and  it  is  similar  to  existing  models 
of  lateral  inhibition  (Eatliff  and  Hartline,  1959p  Batliff,  Hartline  and 
Miller,  1963).  Again  each  neural  nnit  typically  receives  input  from  three 
sources:  itself  and  its  two  nearest  neighbors.  Each  photoreceptor  also 
provides  input  for  the  neural  nnit  immediately  below  it,  but  not  for  any 
other.  The  neural  nnits  form  a  mutually  coupled  system  obeying  a  second 
order  difference  equation,  with  the  inputs  from  the  photoreceptors 
providing  the  excitation  function.  The  solution  of  such  a  system  is  given 
by  the  convolution  of  the  excitation  function  with  the  impulse  response  of 
the  mutually  coupled  system,  which  is  exactly  the  desired  result. 

This  scheme  has  a  number  of  significant  advantages  over  the  first. 
There  is  only  one  connection  per  photoreceptor  independent  of  the 
interpolation  factor  N.  The  number  of  different  weights  is  also 
independent  of  N.  There  is  one  weight  for  the  photoreceptor  connection,  a 
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aacond  for  the  connection  between  a  neural  unit  and  itaelf,  and  a  third  for 
the  connection  between  a  neural  unit  and  ita  neareat  neighbors.  Another 
weight  would  only  be  required  if  each  neural  unit  had  to  be  connected  to 
ita  aecond-neareat  neighbora,  which  would  reault  in  a  fourth  order 
difference  equation.  Another  important  advantage  in  the  aiutual  coupling 
acheaie  is  that  all  neural  units  are  the  sane,  except  for  the  additional 
photoreceptor  input  which  aone  receive.  In  the  direct  convolution  scheme, 
the  set  of  weights  used  by  each  neural  unit  depended  on  the  position  of  the 
neural  unit  relative  to  the  photoreceptors,  which  requires  that  each  neural 
unit  'know*  its  position  to  interpolate  properly.  This  complication  does 
not  arise  in  the  second  scheme  where  all  neural  units  are  essentially  the 
same.  That  is,  for  the  direct  convolution,  the  interpolation  function  was 
explicitly  contained  in  a  set  of  weights  that  depended  on  position,  while 
in  the  mutual  coupling  case  the  interpolation  is  done  implicitly  by  the 
impulse  response  of  the  second  order  system.  There  is  one  further 
advantage  to  the  second  scheme.  Suppose  that  something  caused  the  output 
of  one  neural  unit  to  fluctuate  upwards.  Since  this  output  is  an  input  for 
the  adjacent  neural  units  and  we  estimate  that  the  relative  weight  given  to 
nearest  neighbors  is  positive  (and  approaches  unity  as  the  interpolation 
factor  increases),  these  neural  units  will  also  fluctuate  upwards,  with  the 
net  result  that  the  fluctuation  in  the  difference  between  adjacent  neural 
units  will  be  highly  suppressed.  Since  peaks  are  determined  only  by  these 
differences,  this  results  in  a  considerable  degree  of  noise  suppression  for 
the  peak  detectors,  a  suppression  that  does  not  occur  in  the  first  scheme. 

A  natural  consequence  of  the  mutual  coupling  scheme  is  that  the 
interpolation  factor  (the  ratio  of  photoreceptor  to  neural  unit  spacing) 
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aust  be  en  integer.  There  cannot  be  a  fractional  noaber  of  nenral  unit* 
between  two  photoreceptors.  The  direct  convolntion  achene  only  requires 
that  the  interpolation  factor  be  the  ratio  of  two  integers  to  guarantee 
that  the  neural  lattice  is  coherent  with  the  photoreceptor  lattice. 

However,  the  actual  coupling  systea  as  illustrated  in  Figure  3b  is 
oversinplif led  since  a  one  dinensional  systea  with  only  nearest-neighbor 
couplings  cannot  have  an  iapulse  response  as  shown  in  Figure  2a.  However, 
a  two  dinensional  systea  on  a  hexagonal  lattice  probably  could  have  such  an 
iapulse  response,  at  least  for  certain  orientations,  and  a  hexagonal  systea 
with  nearest-neighbor  and  second-nearest-neighbor  couplings  alaost 
certainly  could.  Since  the  priaate  photoreceptor  array  has  been  shown  to 
be  a  highly  regular  hexagonally  packed  lattice  (Miller,  1979p  Hirsch  and 
Hylton,  1983)  we  consider  the  actual  coupling  systea  to  be  a  viable 
notion. 

Now  let  us  consider  the  constraints  that  can  be  placed  on  the 
interpolation  function.  The  aost  important  is  that  peaks  should  be 
reconstructed  accurately,  and  this  will  be  true  of  aost  any  sysunetric 
function  of  reasonable  width  which  is  itself  quadratic  near  x  ■  0.  We 
estiaate  this  width  as  follows.  As  noted  above,  quadratic  interpolation  in 
one  diaension  requires  input  froa  at  least  three  photoreceptors.  This  sets 
a  ainiaua  'range'  of  about  -*-1.5  photoreceptors.  A  two  dinensional 
quadratic  has  six  coefficients,  and  thus  requires  at  least  six  inputs.  If 
each  photoreceptor  has  six  neighbors,  then  a  'range*  of  1.5  photoreceptors 
is  again  sufficient.  Suppose  the  range  were  greater  than  this.  If  the 
iaage  were  indeed  quadratic,  this  would  allow  a  aore  accurate  constrained 
interpolation.  However,  if  the  iaage  were  not  quadratic,  this  would  result 
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in  the  peek  et  e  particular  poaition  being  ahifted  around  by  the  luainance 
at  a  relatively  diatant  point.  If  we  aaauae  that  two  fully  resolved  lines 
ahonld  not  interfere  with  each  other,  thia  aeta  a  aaxiaua  range  roughly 
coaparable  to  the  range  of  the  line  apread  function  for  very  fine 
eeparationa.  Then,  the  interpolation  functiona  aight  look  like  Fig  2a  vith 
a  total  width  coaparable  to  the  line  apread  function,  aay  about  1  or  2 
ainutea  from  the  poaitive  peak  to  the  negative  peak.  However,  thia 
aaauaption  doea  not  aeea  to  be  correct  (i.e.,  there  are  'crowding 
effects') . 

A  aeries  of  ezperiaenta  reported  by  Vestheiaer  et  al.(1975)  aay 
provide  a  better  eatiaate  of  the  width  of  the  interpolation  function. 
Vestheiaer  has  shown  that  the  ability  of  observers  to  diacriainate  the 
vernier  offset  of  a  pair  of  vertical  lines  or  the  orientation  of  a  single 
vertical  line  is  susceptible  to  interference  froa  flanking  lines,  and  that 
the  interference  is  aaxiaua  when  the  flanking  lines  were  offset  froa  the 
target  lines  by  about  2-3  ainutea  of  arc.  If  we  attribute  the 
interference  to  a  conflict  between  the  negative  aidelobes  of  the  flanking 
lines  and  the  positive  central  lobe  of  the  target  lines  (perhaps  by  causing 
aultiple  peaks  or  soae  other  non-quadratic  effect),  this  indicates  that  the 
negative  lobes  are  about  2  -3  ainutea  froa  the  central  positive  lobe, 
reasonably  coaparable  to  the  previous  guess.  Indeed,  if  the  exact 
aechanisa  of  the  interference  were  understood,  this  experiaental  technique 

v 

aight  provide  a  fairly  direct  aethod  for  deteraining  the  exact  shape  of  the 
interpolation  function. 

As  discussed  below,  the  interpolation  factor  N  varies  in  such  a  way 
that  the  spacing  of  neural  units  and  the  jnd  in  separation  are  roughly 
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proportional  to  the  reparation  between  lnainance  peaks.  This  raises  the 
possibility  that  the  width  of  the  interpolation  function  is  not  constant 
but  rather  scales  with  peak  separation  (or  rather  with  N).  This  would 
naturally  follow  if  peaks  and  edges  tended  to  become  wider  as  the 
separation  between  thea  increased. 

Separation  aeasurinr  and  scaling 

In  this  section  we  discuss  the  process  of  aeasuring  separations 
between  peaks.  This  discussion  is  largely  based  on  results  of  an 
experiment  which  effectively  aeasured  the  fractional  jnd  in  separation 
(As/s)  as  a  function  of  the  inverse  separation  (1/s)  between  two  peaks 
(Hirsch  and  Hylton,  1982).  (The  jnd  for  discriminating  between  spatial 
frequencies  was  also  aeasured  as  a  function  of  spatial  frequency,  and  those 
results  were  also  found  to  depend  only  on  the  distance  between  two  peaks  in 
the  spatial  frequency  grating.  An  example  of  the  results  appears  in  Fig. 

4.  The  relation  between  frequency  and  aeparation  is  f  *  1/s  and  Af/f  * 
As/s).  It  was  found  that  As  tended  to  take  on  discrete  values  given  by  1/N 
tiaes  the  photoreceptor  spacing,  and  that  the  value  of  N  varied  with  s  in 
such  a  way  that  As/s  was  roughly  constant.  Specifically,  it  was  found  that 
As/s  as  a  function  of  1/s  consisted  of  segaents  of  constant  As  with  As 
given  by  ,008*/N  within  a  segaent,  and  that  the  transition  between 
different  segaents  occurred  at  intervals  that  were  equally  spaced  in  1/s 
and  coincided  with  a  aaxiaua  value  of  As/s  of  approximately  .032.  On  the 
average.  As/s  had  a  value  of  .02$. 

The  first  step  in  separation  aeasureaent  is  to  locate  the  positions  of 
the  peaks.  This  is  illustrated  in  Fig  If  where  each  neural  unit,  plus  the 


14 


Birsch,  Joy  and  Hylton,  ton 


too  adjacent  neural  units,  ia  fad  into  a  peak  detector  (equates).  The  peak 
detector  compares  the  too  successive  differencea  between  the  three  neural 
naita  and  dacidea  if  the  aiga  of  the  alopa  haa  ehaaged.  The  output  of  the 
peak  detector  ia  a  logical  aignal  indicating  that  a  peak  haa  occurred 
oithin  +1/2  neural  unit  of  the  current  position.  Thus  the  effective 
spacing  of  the  neural  naita  introduces  a  fundamental  limit  to  the  accuracy 
oith  ohich  a  peak  can  be  located,  the  positional  quantisation  error  of  the 
neural  unit  lattice.  The  location  of  the  second  peak  is  similarly 
determined  to  an  accuracy  of  +1/2  neural  unit.  The  distance  between  the 
too  peaks  ia  then  determined  to  an  accuracy  of  +1  neural  unit.  This 
reflects  the  essentially  digital  nature  of  the  position  information: 
separation  ia  measured  by  counting  the  number  of  neural  units  separating 
the  too  peaks,  and  the  counting  proceaa  ia  subject  to  a  +1  count 
uncertainty.  Alternatively,  the  quantisation  error  can  be  attributed  to 
the  'rounding'  of  the  peak  position  to  the  nearest  neural  unit  position. 

It  is  clear  that  the  limit  of  resolution  in  separation  is  proportional 
to  the  apacing  of  the  neural  units.  If  the  neural  unit  apacings  take  on 
the  values  d/N  as  described  previously,  then  the  jnd  in  separation  will  be 
proportional  to  d/N.  Experimentally,  os  find  that  As  -  ,008*/N,  where  As 
ia  defined  as  the  difference  in  separation  necessary  to  correctly 
discriminate  between  too  different  separations  75%  of  the  time.  If  oe 
assume  the  psychometric  function  is  gaussian.  then  a  75%  correct  response 
corresponds  to  a  mean  displacement  of  .68  standard  deviations.  Thus  the 
rms  error  is  .008*/(.<8)  “  1.47  *  .008*.  Anatomically,  the  photoreceptor 
apacing  d  is  estimated  to  be  about  SO  aeconds  or  .008*  Vestheimer.  1979). 
and  hence  the  rms  error  is  1.47  *  d  or  1.47  counts.  8ince  the  maximum 
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quantization  error  is  only  +1  count,  en  ras  value  of  1.47  i»  clearly 
larger  than  expected  anggeating  that  there  nay  be  other  aourcee  of  error 
which  are  proportional  to  the  neural  unit  spacing.  (Ve  would  expect  the 
ras  error  to  be  about  .5-. 7  counts  for  pure  quantization  error).  An 
alternative  explanation  aight  be  that  the  peak  detectors  are  spaced  every 
two  neural  units  rather  than  every  neural  unit,  doubling  the  quantization 
error. 

Scalinz 

Eesults  of  the  cited  experiaent  show  that  the  jnd  in  separation  As  is 
not  constant  but  rather  varies  with  the  inverse  separation  (1/s)  being 
discriainated  in  such  a  way  that  As/s  is  roughly  constant.  That  is,  the 
value  of  N  is  autoaatically  chosen  to  keep  the  neural  unit  spacing  roughly 
proportional  to  the  separation  between  the  peaks,  which  is  generally 
described  by  saying  that  As  scales  with  s. 

Suppose  two  reconstructed  peaks  are  separated  by  a  distance  s  on  a 
neural  lattice  with  spacing  Ax.  Then  the  separation  in  tents  of  neural 
units  (counts)  is  given  by  Lps/Ax  with  a  +1  neural  unit  uncertainty.  The 
fractional  uncertainty  in  separation  is  given  by  1/L  “  Ax/s.  Fro*  this  it 
can  be  seen  that  requiring  the  fractional  error  to  be  constant  is 
essentially  the  sane  as  requiring  that  the  nuaber  of  neural  units  spsnning 
the  two  peaks  (L)  be  constant.  As  s  becoaes  saall  the  fractional  error 
becoaes  large,  and  at  zone  point  reaches  a  aaxiaua  tolerable  value  of 

i  "nax  “  1/Lain*  (Nuaerically,  «*tx  is  found  to  be  .032  and  hence 

i 

|  Lain  "  30.)  To  keep  perforaanoe  acceptable  for  saaller  values  of  s  the 

value  of  Ax  aust  be  decreased.  As  discussed  above,  the  nature  of  the 

i 
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interpolation  mechanism  la  anch  that  Ax  haa  the  fora  d/N  where  d  la  the 
photoreceptor  spacing  and  N  ie  the  integer  interpolation  factor.  Then  the 
fractional  error  ia  given  by  (d/N)/s  and  the  ainiana  value  of  N  for  a  given 
i  ii  K  ■  Lain  <*/»,  rounded  upwards. 

One  question  that  night  arise  at  this  point  is  why  N  varies  at  all. 

At  first  glance  it  aight  seen  simpler  to  keep  N  at  a  large  fixed  value. 

The  aost  direct  answer  to  this  is  that  experiaents  ahow  that  N  does  vary 
and  in  fact  takes  on  the  value  required  to  keep  As/a  constant.  There  are 
two  arguments  to  justify  this  behavior.  The  first  comes  from  a 
consideration  of  the  errora  introduced  in  separation  measurement  by  errors 
in  the  spacing  of  photoreceptors.  Assume  that  the  photoreceptor  lattice  is 
constructed  by  placing  end  to  end  intervals  of  aean  width  d  and  error  o. 

The  different  intervals  are  assumed  to  be  independent  of  each  other.  Then 
after  we  have  placed  down  L  intervals  we  have  covered  a  mean  distance  of  a 
■  Ld  with  an  accumulated  error  of  w*SQRT(L)  *  o*SQRT(s/d) .  Thus  the 
accumulated  separation  error  on  the  photoreceptor  lattice  increases 
proportionally  to  SQRT(s),  and  this  error  will  exceed  any  fixed  error  as  s 
increases.  The  essential  point  is  that  there  exist  sources  of  error  which 
increase  as  a  increases  and  thus  As  must  also  increase  with  a.  If  we 
assume  that  objects  have  some  natural  width  which  increases  as  the 
separation  between  objects  increases  (i.e.,  that  the  boundaries  of  objects 
become  bigger  as  the  objects  become  bigger)  then  this  gives  another  source 
of  error  that  increases  with  a. 

The  photoreceptor  spacing  argument  can  be  inverted  to  place  a  limit  on 
the  fractional  rms  error  ia  photoreceptor  spacing  (o/d).  Te  find 
experimentally  that  the  average  jnd  in  separation  is  given  by  As  ■  «  •  a 
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where  «  ■  .025.  As  disentssd  above,  the  jnd  corresponds  to  .68  tines  the 
ms  error  in  s.  Thus  if  the  photoreceptor  spacing  error  is  not  to  dominate 
As,  we  require  that  the  contribution  of  the  photoreceptor  error  to  As  is 
lass  than  1/2  the  observed  value  of  As  or 

.68  •  o  •  SORT ( s / d )  <  1/2  •  «  •  s  or 
o/d  <  1/2  •  1/.68  •  «  •  SQRT(s/d) . 

The  average  value  of  L  *  s/d  is  1/*  •  40  requiring  that  o/d  <  .12,  a 
reasonable  number.  Assuming  a  minimum  separation  of  4  photoreceptors  (s/d 
•  4)  requires  o/d  <  .037,  a  fairly  small  number.  Perhaps  there  exists  a 
small  region  of  the  fovea  in  which  photoreceptor  spacing  error  is  small, 
and  the  measurement  of  small  separations  could  be  restricted  to  this 
region. 

As  an  alternative  to  requiring  accurately  spaced  photoreceptors,  it  is 
conceivable  that  the  system  includes  a  'self  calibration*  feature  that  can 
correct  for  errors  in  photoreceptor  spacing.  This  is  possible  since  the 
spacing  errors  cause  systematic,  position  dependent,  errors  in  s.  If  s  is 
measured  through  the  AND  gate  mechanism  illustrated  in  Fig  lg,  the 
calibrations  could  be  done  by  sweeping  a  pair  of  peaks  of  fixed  separation 
across  the  retina  and  marking  all  AND  gates  which  respond  as  belonging  to 
the  same  value  of  s.  This  limits  the  error  contribution  from  photoreceptor 
spacing  to  about  one  neural  unit  since  a  spacing  error  greater  than  this 
will  result  in  the  AND  gate  being  classified  as  responding  to  a  different 
value  of  a.  Note  that  this  error  is  still  proportional  to  the  neural  unit 
spacing.  (Similarly,  since  any  errors  in  the  interpolation  weights 
represent  systematic  errors,  it  is  possible  to  correct  them.)  However, 
while  such  corrections  are  conceivable,  we  feel  that  it  is  more  likely  that 
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the  photoreceptor  lattice  ie  constructed  with  sufficient  accuracy  to  begin 
with. 

The  second  arguaent  in  favor  of  scaling  is  that  having  a  nuaber  of 
different  N's  does  not  necessarily  increase  the  coaplexity  of  the  whole 
systea,  defined  as  the  nuaber  of  parts  required  to  build  it.  In  fact,  if 
one  atteapts  to  count  the  total  nuaber  of  neural  eleaents  required  to 
construct  a  hardware  seperation  detector,  it  turns  out  that  a  systea  with 
As/s  constant  actually  requires  vastly  fever  eleaents  than  one  with  As 
constant.  This  occurs  because  the  systea  aust  cheek  all  pairs  of  peak 
detectors  and  the  systea  with  scaling  has  many  fewer  pairs.  This  is 
illustrated  in  Fig  lg.  The  separation  detector  aust  per fora  the  operation 
'Peak(x)  AND  Peak(x  +  s) *  OE'ed  over  all  starting  positions  x.  To  see  the 
effect  of  scaling,  assuae  that  the  left  hand  peak  position  is  fixed  at  x  * 
0.  Then  the  nuaber  of  pairs  (AND  gates)  required  to  cover  all  possible 

position  for  the  right  hand  peak  froa  sB^n  to  is  given  by 

•a  ax 


•a  in 

If  we  take  As  as  the  constant  Asa^n  -  *  •  sa£n  then  the  nuaber  of 
pairs  is  (1/  «)(»-ax/‘a  £n) .  On  the  other  hand,  if  As  vsries  with  s  as 

As  ■  «  •  s,  the  integral  becoaes 

<l/«)  In  (sBgz/ sBin) . 

For  large  •Bgx^aain  «y«tea  with  scaling  requires  aany  fever  pairs 
than  the  one  without. 

The  above  analysis  corresponds  to  a  one  diaensional  systea  in  which 
the  iaage  is  required  to  be  exactly  centered  in  the  visual  field,  and  is 
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only  aeant  as  an  illustration.  The  correct  analysis  requires  knowledge  of 
how  the  neural  units  are  distributed  in  two  diaensions  as  a  function  of 
eccentricity.  Actually,  the  above  case  is  relatively  unfavorable  to  the 
scaling  aechanisa.  The  true  advantage  of  the  scaling  aechanisa  is  probably 
anch  better  than  x/ln(x).  Extending  the  above  arguaent  to  two  diaensions 
introduces  an  extra  factor  of  s/As  under  the  integral  sign.  Then  with  no 
scaling  the  number  of  eleaents  is  1/2  (1/*)*  (sBBX/eB£n)2  but  with 
scaling  the  number  is  (l/«)*  In  (•atz/sB£n),  still  only  a  logarithmic 
increase. 

It  should  be  noted  that  the  notion  of  scaling  presented  here  differs 
fundamentally  from  the  usual  notion  of  scaling  with  eccentricity.  The  aore 
conventional  view  is  that  there  exiats  a  single  non-linear  aapping  between 
the  retina  and  the  cortex,  with  a  step  in  distance  of  constant  sixe  on  the 
oortex  corresponding  to  a  step  in  distance  on  the  retina  whose  sixe 
increases  with  eccentricity.  In  contrast  to  this,  we  suggest  that  there 
exists  a  set  of  linear  aappings  froa  the  retina  to  the  oortex  with 
different  scales  or  aagnif ication  factors.  Vhile  the  different  mappings 
have  different  magnification  factors,  within  one  aap  the  magnification 
factor  is  constant.  The  physiologically  and  psychophysical ly  observed 
increase  in  aagnif ioation  factor  with  eccentricity  presumably  occurs 
because  progressively  finer  aappings  are  restricted  to  progressively 
saaller  eccentrioities.  This  corresponds  to  requiring  that  images  be 
roughly  centered  in  the  visual  field.  Then  the  boundaries  of  wider  objects 
tend  to  fall  at  larger  eccentricities  while  small  objects  lie  within  small 
eccentricities,  and  the  fine  aaps  are  only  needed  at  small  eccentricities. 
Further,  if  we  assume  that  roughly  equal  neural  resources  are  allocated  to 
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each  aap,  then  the  total  area  that  can  be  covered  by  a  sap  decreases  as  its 
fineness  increases.  As  a  function  of  eccentricity,  we  night  expect  As  to 
be  constant  over  soae  central  region  covered  by  the  finest  asp,  and  then 
increase  in  a  series  of  steps  as  the  boundaries  of  successively  coarser 
asps  are  exceeded,  with  As  roughly  proportional  to  eccentricity.  Note  that 
there  is  no  a  priori  reason  for  the  various  asps  to  be  circularly 
syaaetric.  The  sealed  linear  asps  have  trenendous  advantages  over  a 
nonlinear  asp  in  terns  of  the  siaplicity  of  iaage  analysis  and  pattern 
recognition.  It  is  also  difficult  to  accurately  aeasure  distances  on  a 
nonlinear  aap  since  this  requires  integrating  variable  aagnif ication 
factors  along  soae  path  which  is  likely  to  be  difficult  to  define. 

Scaling  at  Low  Frequencies 

The  discussion  of  scaling  above  was  basically  addressed  to  a  high 
frequency  or  hyperacuity  region  involving  neural  lattices  with  spacing 
equal  to  or  less  than  the  photoreceptor  spacing.  Given  the  advantages  of  a 
scaled  systen  over  a  non-scaled  system,  it  would  seen  quite  reasonable  to 
assume  that  the  scaling  property  is  general  and  applies  to  all  image  sixes 
or  spatial  frequencies  accessible  to  the  hunan  visual  systen,  say  angular 
sixes  of  90*  to  1/30*,  and  preliminary  neasurenents  indicate  that  this  is 
indeed  the  case.  Ve  now  discnss  how  scaling  night  be  accomplished  at  low 
frequencies.  At  high  spatial  frequencies  the  effective  neural  spacing  is 
d/N  where  d  is  the  photoreceptor  spacing  and  N  decreases  with  decreasing 
frequency  in  order  to  keep  d/N  proportional  to  s  -  1/f.  This  schene  will 
fail  eventually  simply  because  there  are  no  positive  integers  less  than  1, 
and  indeed  a  breakdown  in  the  d/N  dependence  is  observed  to  occur  below  2 
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cycles/deg,  with  As  remaining  roughly  proportional  to  a  rather  than 
becoaing  a  constant  equal  to  d.  In  fact,  experimentally  we  observe  that 
for  foveal  measurements  between  about  1  and  2  cycles/deg.  As  takes  on  the 
▼aloe  d  •  2  which  we  interpret  as  evidence  for  a  neural  lattice  with 
spacing  twice  the  photoreceptor  spacing.  Thus  at  low  frequencies  the 
function  of  the  scaling  system  is  to  reduce  an  excessively  fine  retinal 
photoreceptor  lattice  to  a  coarser  neural  lattice  which  is  presumably  more 
tractable  since  it  has  fewer  elements.  There  are  two  straightforward 
mechanisms  for  accomplishing  this,  one  possible  only  in  the  central  region 
of  the  retina  (defined  as  the  region  where  each  photoreceptor  has  its  own 
connection  to  the  cortex)  and  the  other  suitable  for  foveal  and  also 
peripheral  regions  where  there  are  many  fewer  cortical  connections  than 
photoreceptors. 

In  the  central  region  where  there  exists  one  connection  per 
photoreceptor  the  basic  task  is  simply  to  produce  a  coarser  lattice.  This 
could  be  done  simply  by  constructing  a  neural  lattice  whose  spacing  is  N 
times  coarser  (not  finer)  than  the  photoreceptor  spacing  and  using  an 
interpolation  mechanism  like  Fig.  3b  in  reverse,  with  1  neural  unit  output 
per  N  photoreceptor  inputs  rather  than  N  outputs  per  each  photoreceptor 
input,  with  a  broad  interpolation  function  to  provide  pooling  over  N 
photoreceptors  rather  than  an  interpolation  between  photoreceptors.  In 
this  case  As  takes  on  the  fora  d  *  N  as  opposed  to  the  previous  case  where 
As  was  d/N,  with  the  value  of  N  increasing  to  keep  As/s  constant  as  s 
increases.  Note  that  N  now  increases  with  a  to  keep  As/s  from  getting  too 
small,  whereas  previously  N  increased  as  1/s  increased  to  keep  As/s  from 
getting  too  large. 
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The  above  scheae  is  only  feasible  in  the  central  region  where  there 
already  anst  exist  one  cortical  connection  per  photoreceptor  to  allow  1/N 
interpolation.  In  peripheral  regions  where  the  hyperacnity  interpolation 
aechanisa  does  not  exist  it  would  be  foolish  to  transait  the  output  of 
every  photoreceptor  to  the  cortex  only  to  discard  aost  of  the  information. 

A  auch  aore  elegant  scheae  would  be  to  pool  groups  of  photoreceptors  into 
spatial  sampling  units,  effectively  foraing  large  photoreceptors,  and 
transait  only  one  output  per  each  pool  to  the  cortex.  As  long  as  the 
widths  of  the  pools  and  the  center-to-center  spacing  of  the  sampling  points 
(pool  centers)  is  such  that  a  narrow  feature  contributes  to  at  least  three 
saaple  points  one  can  then  invoke  a  cortical  interpolation  aechanisa 
identical  to  the  one  that  interpolated  between  individual  photoreceptos  to 
now  interpolate  between  the  saapling  pools.  Indeed,  the  whole  aechanisa  is 
identical  to  the  one  discussed  for  high  frequencies  except  that  the  line 
spreading  is  now  accomplished  by  retinal  pooling  rather  than  optical 
blurring,  and  the  retinal  lattice  is  formed  by  the  centers  of  the  pools 
rather  than  by  individual  photoreceptors.  The  retinal  pooling  mechanisms 
night  look  like  Fig.  3b  in  reverse,  as  above.  If  the  center~to~center 

spacing  of  the  pools  is  K  times  the  photoreceptor  spacing  d,  then  As  has 

the  fora  Kd/N  where  K  is  the  retinal  pooling  factor  and  N  is  the  cortical 
interpolation  factor. 

Preliminary  results  suggest  that  the  second  scheae  is  indeed  employed 
at  low  frequencies,  at  least  for  frequencies  between  .25  and  2  cycles/deg 
(image  sixes  between  .5*  and  4*)  in  the  fovea.  Within  this  range  As 

appears  to  have  the  fora  Kd/N  where  d  is  .008*.  K  is  8,  and  N  takes  on  the 

values  1  through  4.  (That  is,  there  are  4  segaents  to  the  Af/f  curve 


23 


Hirsch,  Joy  and  Hylton,  Eon 


between  .25  end  2c/deg.)  This  is  slightly  different  free  the  high 
frequency  region  where  N  took  on  the  wslnes  1  through  7  or  8.  A  retinal 
pooling  factor  of  8  corresponds  to  a  center-to-center  spacing  for  the 
retinal  pools  of  8  *  .008*  “  .064*  or  1/16*.  If  we  assune  that  there  are  a 
sequence  of  pool  sixes  each  8  tines  larger  than  the  last,  then  inage  sixes 
between  32*  and  4*  will  be  handled  by  pools  with  center-to’-center  spacing 
8  *  1/16*  “  .5*  and  inage  sixes  larger  than  32*  will  be  handled  by  pools 
with  center-tor-center  spacing  of  8  *  .5*  “  4*.  While  these  nunbers  are 
sonewhat  speculative,  it  is  clear  that  the  high  frequency  band  (E  -  1,  8 
segments,  2  to  32  c/deg)  plus  three  low  and  intermediate  frequency  bands  (E 
«  8,  64,  and  512  with  4  segments  each,  minimum  frequencies  2/E  cycles/deg) 
provide  a  set  of  20  neural  maps  spanning  image  sixes  from  1/32*  to  over 
180*  with  As/s  or  Af/f  never  varying  by  more  than  a  2/1  ratio.  The 
different  values  of  E  correspond  to  sampling  by  individual  photoreceptors 
(E  “  1)  and  to  sampling  by  pools  of  photoreceptors  with  center-to-center 
spacing  of  E  *  .008*  "  1/16*,  .5*,  and  4*  respectively. 

Since  large  images  tend  to  be  associated  with  high  eccentricities  we 
expect  that  there  should  be  a  correlation  between  the  distribution  of  the 
sixes  of  the  retinal  sampling  pools  and  eccentricity.  In  the  fovea  all  4 
values  of  E  might  coexist  with  the  smaller  values  of  E  disappearing  as 
eccentricity  increases  and  only  the  largest  pools  existing  at  the  highest 
eccentricities.  It  will  be  interesting  to  see  if  this  simple  scheme  can  be 
reconciled  with  the  anatomy  of  the  retina. 
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/tool  lent  ion  to  Other  Experiments 

The  discussion  above  vas  specifically  based  on  a  particnlar 
experiment,  the  discr initiation  of  the  separation  between  a  pair  of  parallel 
lines  as  a  function  of  the  separation.  Our  nodel  must  of  course  be  very 
general  if  it  is  to  be  nsefnl.  Te  present  here  an  application  of  the  nodel 
to  sone  other  experiments,  albeit  also  very  simple  ones.  Application  of 
the  model  to  more  complicated  visnal  tasks  probably  would  not  require  any 
fundamental  modification. 

Separation-discrimination  tasks  can  be  regarded  as  having  two 
independent  characterisations.  The  first  is  the  spatial  acale  a  and  the 
second  is  the  jnd  or  resolution  scale  As.  For  the  parallel  line  task  s  vas 
taken  to  be  the  perpendicular  distance  between  the  two  lines  and  As  vas  the 
jnd  in  separation  also  measured  along  the  perpendicular.  The  parallel  line 
task  is  a  special  case  in  which  the  perpendicular  separation  between  the 
lines  is  both  the  stimulus  variable  used  to  measure  As  and  also  the 
scale-setting  distance. 

A  quite  different  situation  prevails  in  the  classical  vernier 
discrimination  task  which  measures  the  jnd  for  aligning  two  pointers 
separated  by  some  gap.  In  this  case  the  spatial  scale  a  is  defined  by  the 
length  of  the  gap  between  the  two  pointers  whereas  As  is  the  jnd  lateral 
offset  between  the  pointers  and  is  measured  perpendicular  to  the  gap.  Thus 
s  for  the  vernier  discrimination  task  is  not  the  stimulus  variable  but 
rather  a  parameter  of  the  experiment.  Further,  a  and  As  are  measured  along 
orthogonal  directions.  Thus,  outside  the  context  of  our  model,  it  is  by  no 
means  obvious  that  there  should  be  a  direct  connection  between  a  and  As 
(gap  length  and  vernier  threshold).  Nevertheless,  we  predict  that  As  for 
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vernier  discriaination  should  be  the  aaae  as  As  for  line  separation 
discriaination  if  the  gap  length  annals  the  parallel  line  separation, 
neglecting  orientation  effects  discnsssd  below.  Preliainary  results 
confira  this  prediction.  The  connection  between  the  two  experiaents  can  be 
seen  aore  clearly  if  we  reduce  the  lines  in  the  two  experiaents  to  saall 
dots.  Then  the  parallel  line  tash  aeasures  As  along  the  line  connecting 
the  dots  whereas  vernier  discriaination  aeasures  As  perpendicular  to  the 
line  connecting  the  dots.  The  scale  s  in  both  cases  is  set  by  the  distance 
between  the  dots,  and  it  is  s  which  detexaines  As  for  both  directions 
(neglecting  orientation  effects).  This  exaaple  indicates  that  applying  our 
aodel  to  other  experiaents  aainly  requires  the  identification  of  the 
scale-setting  distance  s,  which  aay  not  be  easy  for  coaplicated  tasks. 

There  is  another  concern  in  applying  the  aodel  to  saall  separations. 
Much  of  the  discussion  above  was  based  on  an  experiaent  'which  effectively 
aeasured  the  ability  of  observers  to  discriainate  the  separation  between 
two  parallel  lines'.  Ve  included  the  word  'effectively'  because  that 
experiaent  aainly  aeasured  the  ability  of  observers  to  discriainate  between 
sinusoidal  spatial  frequency  gratings  of  different  spatial  frequencies.  Te 
have  shown  that  spatial  frequency  discriaination  is  the  sane  as 
discriainating  the  aeparation  between  two  lines  with  separation  s  ■  1/f, 
where  f  is  the  spatial  frequency.  However,  sinusoidal  gratings  possess  one 
sterling  virtue  that  ashes  then  wore  suitable  for  studying  saall  values  of 
a  (high  values  of  f)  than  the  corresponding  line  pair  (or  dot  pair)  would 
be.  This  virtue  is  that  no  linear  operation  (line  spread,  interpolation) 
ean  change  the  shape  of  the  grating,  but  rather  can  only  ohange  the 
contrast  and  shift  the  phase.  Thus  the  peaks  of  a  grating  of  frequency  f 
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■re  always  aeparated  by  a  *  1/ f ,  no  natter  how  aaall  a  gets  (when  it  gets 
too  aaall,  the  grating  cwjtrast  becoaes  too  low  for  the  grating  to  be 
seen).  In  contrast  to  this,  as  two  lines  get  close  together,  their 
profiles  (as  deterained  by  the  line  spread  and  interpolation  function) 
begin  to  interfere  ao  that  the  aeparation  between  the  two  peaks  will  no 
longer  siaply  be  the  separation  between  the  external  narrow  lines.  Then 
they  get  too  close  together  there  will  only  be  one  peak  and  the  two  lines 
will  appear  to  be  a  aingle  fat  line.  Thus  in  practice  there  will  often  be 
critical  ainiana  separations  and  line  lengths  in  an  experiaent  (on  the 
order  of  a  few  ainotes)  and  our  nodel  night  break  down  if  these  conditions 
are  not  net.  We  also  find  that  failing  to  randomize  the  positions  of 
atiauli  can  produce  unexpected  results,  and  that  a  repetitive  atiaulus  such 
as  a  grating  generally  gives  better  defined  segaentation  than  a  non- 
repetitive  pattern  auch  at  a  aingle  pair  of  lines,  possibly  because  the 
repetition  provides  stronger  excitation  to  the  acale  aetting  aechanisa. 

Orientation  Effects 

There  is  one  further  point  that  needs  to  be  aade  in  our  nodel  which 
does  not  appear  in  any  of  the  one-diaensional  arguments  given  previously. 
The  problea  is  the  identification  of  the  perpendicular  distance  between  two 
parallel  lines  as  the  appropriate  definition  of  their  separation.  While 
this  is  a  aiaple  and  appealing  definition  it  ia  by  no  aeans  obviously 
corract.  One  could  certainly  iaagine  that  there  existed  certain  intrinsic 
aeasureaent  directions  fixed  with  respect  to  an  observer's  head  orientation 
and  that  separations  are  always  measured  along  one  of  these  natural 
directions  rather  than  along  a  direction  defined  by  the  atiaulus  itself. 
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This  leads  to  the  possibility  that  both  a  and  As  axe  dependent  on  the 
orientation  of  the  stimulus  relative  to  the  observer.  Since  ve  have 
assigned  the  photoreceptor  lattice  a  fundamental  role  in  onr  model,  it 
would  seem  necessary  to  assume  that  the  two  dimensional  characteristics  of 
the  photoreceptor  lattice  must  be  of  importance.  The  fundamental  two- 
dimensional  characteristic  of  photoreceptors  is  their  hexagonal  packing  and 
the  likely  consequence  of  this  for  our  model  is  that  there  exist  intrinsic 
directions  for  measuring  separation  and  that  these  directions  are  eeparated 
in  orientation  by  60°.  This  suggests  that  the  effective  distance  between 
two  parallel  lines  is  the  perpendicular  distance  divided  by  the  cosine  of 
the  angle  between  the  perpendicular  direction  and  the  closest  intrinsic 
direction.  Since  this  angle  cannot  be  greater  than  30*,  the  maximum 
possible  effect  is  l/eos(30*)  -  1.15.  Thus,  the  effective  distances  may  be 
larger  than  the  perpendicular  distances  by  as  much  as  15%  (or  spatial 
frequencies  lowered  by  15%)  as  orientation  ia  varied.  Since  the  effect  is 
systematic,  it  can  in  principle  be  corrected  for  and  thus  may  not  be 
measureable.  However,  if  changing  the  orientation  caused  the  scale  setting 
separation  to  pass  through  a  transition  from  one  value  of  N  to  another,  the 
resulting  change  in  As  would  be  measureable,  and  indeed  we  have  observed 
this  effect  (Hirsch  and  Hylton,  1983). 


Stereo  Vision 


Ve  have  not  considered  stereo  vision  iu  detail.  However,  much  of  the 


model  seems  directly  transferable  to  stereo  acuity  tasks.  Ve  make  this 
argument  based  on  two  results  reported  by  Butler  and  Vestheimer  (1978). 
First,  stereo  disparity  thresholds  are  susceptible  to  interference  from 
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flanking  lint*  in  direct  analogy  to  the  intarfaranoa  of  flanking  line  a  on 
vernier  acnity  thresholds  and  line  orientation  threaholda.  Since  we 
attribnte  this  interference  to  the  interpolation  mechanism,  ve  take  this  as 
evidence  that  aaapling  and  interpolation  aa  diaenssed  above  applies  to 
stereo  vision. 

Bntler  and  Vestheiaer  (1978)  also  report  that  stereo  disparity 
thresholds  increase  aa  disparity  increases,  which  he  describes  as  a 
disparity  tuning  function.  Ve  interpret  this  aa  the  analog  of  As/s  scaling 
for  stereo  vision  where  As  is  identified  aa  the  jnd  in  disparity  and  a  is 
disparity.  Ve  suggest  that  disparity  threshold  aa  a  function  of  disparity 
is  analogous  to  line  separation  threshold  as  a  function  of  line  separation 
and  vernier  offset  threshold  as  a  function  of  gap  length.  The  rise  of 
disparity  threshold  with  disparity  is  not  the  result  of  a  disparity  tuning 
aechanisai  bnt  rather  a  consequence  of  the  ubiquitous  sealing  mechanism, 
with  disparity  identified  as  the  scale  setting  variable  for  stereo  acuity 
tasks.  This  night  be  envisioned  as  two  parallel  lattice  planes,  one  for 
each  eye,  stacked  vertically  and  registered  in  such  a  way  that  features 
with  zero  disparity  are  vertically  aligned.  Disparity  is  then  the  distance 
measured  parallel  to  the  lattices  between  a  point  in  one  lattice  and  the 
corresponding  point  in  the  other,  whereas  a  spatial  separation  is  measured 
between  two  points  in  the  same  plane,  establishing  the  analogy  between 
disparity  and  spatial  separation.  Then  disparity  thresholds  should  scale 
with  disparity  just  as  spatial  thresholds  scale  with  spatial  variables. 


tean 

Ve  have  presented  a  model  for  the  processing  of  positional  information 
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in  the  human  visual  system.  The  model  esn  be  summarized  as  follows. 

An  external  Imminence  distribution  or  image  is  blurred  by  convolution 
with  the  line  spread  function  so  that  a  narrow  line  is  spread  over  at  least 
three  photoreceptors.  The  blurred  image  on  the  retina  is  then  sampled  by 
the  photoreceptor  lattice  and  transmitted  to  higher  visual  centers.  The 
neural  represenation  of  the  image  is  constructed  by  interpolating  the 
sampled  function  onto  a  lattice  of  neural  units.  The  reconstruction 
process  is  implemented  by  convolution  with  the  interpolation  function.  The 
effective  spacing  of  elements  in  the  neural  lattice  is  finer  than  the 
spacing  of  photoreceptors  by  an  integer  factor  N.  The  neural  lattice 
epacing  determines  the  level  of  accuracy  to  which  positions  of  stimulus 
features  can  be  determined.  Higher  values  of  N  correspond  to  higher  levels 
of  spatial  resolution. 

Position  information  in  the  recontructed  image  is  extracted  by 
determining  the  locations  of  specific  stimulus  features  on  the  neural 
lattice,  in  particular  the  luminance  peaks.  The  separation  (s)  between 
features  such  as  peaks  is  determined  by  counting  the  number  of  neural 
units  between  the  features.  Hence  the  error  in  determining  a  separation 
(As)  is  proportional  to  the  positional  quantization  error  (spacing)  of  the 
neural  unit  lattice. 

The  interpolation  factor  N  (the  ratio  of  photoreceptor  spacing  to 
effective  neural  unit  spacing)  is  chosen  to  keep  the  fractional  error 
(As/s)  .approximately  constant.  More  generally,  the  features  whose 
positions  or  separations  are  being  analyzed  define  a  spatial  scale  (s) 
which  is  also  used  to  set  the  scale  for  the  spatial  resolution  (As). 
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I 

There  is  an  interesting  way  to  isaaarite  the  sampling  and  j 

i 

reconstrnction  systea.  As  discussed  above,  the  'coarse'  sampling  by  the 

i 

photoreceptors  is  the  aost  efficient  encoding  of  the  image  falling  on  the  l 

retina.  This  is  certainly  an  advantage  when  transacting  the  information  ; 

from  the  retina  to  the  cortex.  However,  in  the  cortex  the  aost  efficient 
possible  encoding  is  not  necessarily  a  convenient  representation  of  the 

image.  The  interpolation  systea  provides  a  aechanisa  for  unpacking  the  j 

encoded  information  into  a  aore  easily  analysed  fora.  The  neural  lattices 

of  different  N  represent  different  degrees  of  unpacking,  and  the  degree  of 

unpacking  selected  is  chosen  to  keep  As/s  constant.  The  neural 

representations  of  different  N  contain  no  information  that  was  not  present 

in  the  original  sampling.  They  contain  the  same  information  in  a  aore 


accessible  fora. 


32 


Hirsch,  Joy  and  Hylton,  Bon 


Figure  Captions 

FIG.  2.  A  schematic  of  the  model  is  given  in  Fig  1.  Fig  la  ehows  the 
external  luminance  distribution  which  we  have  taken  to  be  a  delta 
function.  Fig  lb  shows  the  external  image  after  convolution  with  the  line 
spread  function.  Fig  lc  illustrates  sampling  of  the  retinal  image  by 
photoreceptors.  Fig  Id  represents  the  interpolation  process,  which  accepts 
the  coarsely  sampled  input  from  the  photoreceptors  and  produces  a  more 
finely  spaced  sampling  on  the  neural  unit  lattice.  Fig  le.  Ve  have  drawn 
the  neural  unit  spacing  as  1/3  the  photoreceptor  spacing:  in  general  it 
can  be  1/N.  Fig  If  illustrates  peak  detection  on  the  neural  unit  lattice. 

A  peak  detector  (square)  requires  input  from  3  adjacent  neural  units.  Fig 
lg  illustrates  a  separation  detector:  Peak(x)  AND  Peak  (x+s)  is  OR’ed  over 
all  x. 

■IG.  2.  (a)  a  prototype  interpolation  function,  (b)  a  atep  function  that 

is  blurred  by  the  line  spread  function,  (c)  and  then  reconstructed  (d) 
using  the  interpolation  function  in  (a). 

FIG,  i  (a)  Interpol  ation  by  direct  convolution,  (b)  Interpolation  by 
mutual  coupling. 

8ee  text  for  explanation. 
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PIG.  £  Reprinted  froa  Hirech  and  Hylton,  1982  ( Fig .  1).  Fractional  jnd 
in  apatial  frequency,  Af/f,  aa  a  function  of  reference  frequency,  f,  for 
three  observers:  JH,  BA  and  MR.  The  straight  lines  passing  through  the 
origin  represent  regions  of  constant  angular  jnd.  As. 


lain 
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